Identification of CDNA Sequences by Specific Oligonucleotide Sets - A Computer Tool and Application
نویسندگان
چکیده
A computer tool has been developed for revealing sets of oligonucleotides invariant for isofunctional families of DNA (RNA) and for using these in functional identification of nucleotide sequences. The tool allows one to: build up vocabularies of invariant oligonucleotides for the families of isofunctional nucleotide sequences; assess significance of the vocabularies; identify nucleotide sequences with the vocabularies of invariant oligonucleotides; determine the most effective identification parameters to minimize first and second type errors; assess the efficiency of identification of individual isofunctional families with the oligonucleotide vocabularies; determine the evolutionary characteristics of the families of isofunctional sequences on which vocabulary volume depends. Based on the system mentioned, we have analyzed a total of 322 protein-encoding gene families and have built up sets of invariant oligonucleotides, or again, oligonucleotide vocabularies that are characteristic of gene families and subfamilies. Identification of nucleotide sequences belonging to these families with the sets of invariant oligonucleotides revealed has been shown. Under the most effective identification parameters, the first type error (false negative) on control (independent) data was 10-15%, the second type error (false positive) was just 1-2 redundant sequences per sequence being examined. As has been shown, the volume of a vocabulary of invariant oligonucleotides depends on the percentage of variable positions in the multiple alignment within a family.
منابع مشابه
On Lacunary Statistical Limit and Cluster Points of Sequences of Fuzzy Numbers
For any lacunary sequence $theta = (k_{r})$, we define the concepts of $S_{theta}-$limit point and $S_{theta}-$cluster point of a sequence of fuzzy numbers $X = (X_{k})$. We introduce the new sets $Lambda^{F}_{S_{theta}}(X)$, $Gamma^{F}_{S_{theta}}(X)$ and prove some inclusion relaions between these and the sets $Lambda^{F}_{S}(X)$, $Gamma^{F}_{S}(X)$ introduced in ~cite{Ayt:Slpsfn} by Aytar [...
متن کاملDNA Fingerprinting Based on Repetitive Sequences of Iranian Indigenous Lactobacilli Species by (GTG)5- REP-PCR
Background and Objective: The use of lactobacilli as probiotics requires the application of accurate and reliable methods for the detection and identification of bacteria at the strain level. Repetitive sequence-based polymerase chain reaction (rep-PCR), a DNA fingerprinting technique, has been successfully used as a powerful molecular typing method to determine taxonomic and phylogenetic relat...
متن کاملA genetic algorithm for designing gene family specific oligonucleotide sets used for hybridization. The G Protein-coupled receptor Protein superfamily
MOTIVATION Massive oligonucleotide hybridization is one of the most promising technologies of functional genome analysis. The critical point is to design appropriate sets of oligonucleotides that can be used effectively in identification by hybridization. RESULTS Using a genetic algorithm approach, we have attempted to design sets of oligo probes capable of identifying new genes belonging to ...
متن کاملApplication of Benford’s Law in Analyzing Geotechnical Data
Benford’s law predicts the frequency of the first digit of numbers met in a wide range of naturally occurring phenomena. In data sets, following Benford’s law, numbers are started with a small leading digit more often than those with a large leading digit. This law can be used as a tool for detecting fraud and abnormally in the number sets and any fabricated number sets. This can be used as an ...
متن کاملشناسایی مولکولی کاندیدا آلبیکنس های جدا شده از بیماران انکولوژی در چهار مرکز آموزشی - درمانی استان مازندران (85-84)
Background and Purpose: Early detection of Candida species in body site could improve the survival of the immunosuppressed patients by allowing the initiation of specific treatment while the fungal biomass is still low. The aim of this study was the identification of Candida albicans isolated from the oncology patients by molecular methods. Materials and Methods: Sixty two of Candida albic...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Proceedings. International Conference on Intelligent Systems for Molecular Biology
دوره 3 شماره
صفحات -
تاریخ انتشار 1995